Data Science Overview
Data Science, Machine Learning, and All of That
Data Science is fashionable, but what it is, is not quite so clear. The name was coined by Cleveland in 2001, who saw it as a modern way to practice Statistics. One of the central components of his proposal was “computing with data”, the topic of this book. Since then, the term “data science” has had many definitions and has points in common with “Big Data” and academic fields such as Information Science and Decision Science. It is a hybrid field with important components in Statistics, Computer Science, and Management.
The typical data analyst’s job consists of data acquisition, data wrangling, namely cleaning, pre-processing and integration of data, data analysis, hypothesis development and testing, and finally communication of the result. Often, data acquisition and wrangling are the most labor-intensive parts of the job.
Machine Learning is